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Abstract 

We describe a visual programming language for defining be¬ 
haviors manifested by reified actors in a 2D virtual world that 
can be compiled into programs comprised of sequences of 
combinators that are themselves reified as actors. This makes 
it possible to build programs that build programs from com¬ 
ponents of a few fixed types delivered by diffusion using pro¬ 
cesses that resemble chemistry as much as computation. 

Introduction 

Self-replicating programs have been defined using compu¬ 
tational models that vary in expressiveness and verisimili¬ 
tude. If we adopt the definition used in the field of pro¬ 
gramming languages ( Felleisen} 1990| l, then expressiveness 
varies along a spectrum that begins with cellular automata 
(CA) defined using lookup tables, increases with artificial 
chemistries based on symbol rewrite rules, and peaks in 
(more or less) conventional programming languages (which 
themselves vary along a spectrum that begins with machine 
language and ends in high-level languages like Lisp). 

By verisimilitude, we mean providing an interface with 
the affordances and limitations of a natural physics. Models 
with high verisimilitude define virtual worlds. Because CAs 
are spatially embedded and governed by simple rules defined 
on local neighborhoods, most would say that the verisimil¬ 
itude of CAs is high. However, since state is updated ev¬ 
erywhere synchronously, and this (unlike a natural physics) 
requires a global clock, CAs are not indefinitely scalable 
( Ackley! 2013| ). Because asynchronous cellular automata 
(ACA) do not suffer from this limitation yet are just as pow¬ 
erful (|Nakamur^|1974t[Berman and Simon[|1988[|Nehaniv| 


20041, AC As are the gold standard in virtual worlds. 


Many artificial chemistries lack verisimilitude because 
the symbols that the rewrite rules transform are not embed¬ 


ded in any physical space (Berry and Boudol 1990 Paun 


1998 Fontana and Buss 1999|l. Others have far greater 


resemblance to real physical systems (Laing |1977| Smith 
et al. 2003 Hutton 2004 j l. These assign symbols to posi¬ 
tions in a virtual world, restrict interactions to local neigh¬ 
borhoods, and rely on diffusion for data transport. 


Programs written in conventional programming lan¬ 
guages generally require a random access stored program 
(RASP) computer to host them[^ Because of program-data 
equivalence, RASPs permit relatively simple solutions to 
the self-replication problem based on refiection. Yet self- 
replicating programs written in conventional programming 
languages are (in effect) stuck in boxes; it makes no dif¬ 
ference whether it is one big box ( |Ray[ |1994jl or many 


little boxes interacting in a virtual world (Adami et al. 


1994| l; because they read, write, and reside in random ac¬ 
cess memories, the programs themselves are fundamentally 
non-physical. 

In the game of defining virtual worlds and creating self- 
replicating programs inside those worlds, there is a tradeoff 
between the non-contingent complexity of physical law and 
the purely contingent complexity of the initial conditions 
that define a program and its self-description. We propose 
that the ratio of contingent and non-contingent complexity 


is positively correlated with the property that Pattee (19951 


calls semantic closure. Ideally, we would like to pursue an 
approach that combines the expressiveness of conventional 
programming languages with the physical verisimilitude of 
ACAs while maximizing the ratio of contingent and non¬ 
contingent complexity. To do this, we need to break pro¬ 
grams out of their boxes; we need reified programs that as¬ 
semble copies of themselves from reified building blocks; 
we need to imagine programs as polypeptides. 

Superficially, there is a similarity between the sequences 
of instructions that comprise a machine language program 
and the sequences of nucleotides and amino acids that com¬ 
prise the biologically important family of molecules known 
as biopolymers. It is tempting to view all of these sequences 
as ‘programs,’ broadly construed. However, machine lan¬ 
guage programs and biopolymers differ in (at least) one sig¬ 
nificant way, and that is the number of elementary building 
blocks from which they are constructed. The nucleotides 
that comprise DNA and RNA are only of four types; the 
amino acids that comprise polypeptides are only of twenty; 
and while bits might conceivably play the passive represen- 


'See Williams (2014 1 for a notable exception. 
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Figure 1: Framework proposed in this paper (left). Funda¬ 
mental dogma of molecular biology (right). 

tational role of nucleotides, they can not play the active func¬ 
tional role of amino acids; this role can only be played by 
instructions. While the instruction set of a simple RASP can 
be quite small, the number of distinct operands that (in ef¬ 
fect) modify the instructions is a function of the word size 
of the machine, and is therefore (at a minimum) in the thou¬ 
sands]^ The implication for the study of self-replicating pro¬ 
grams is profound; while biopolymers can be assembled by 
physical processes from building blocks of a few fixed types, 
it is impossible to construct machine language programs for 
a RASP this way. 

DNA and RNA are copiable, transcribable and translat¬ 
able descriptions of polypeptides. DNA is (for the most part) 
chemically inert while polypeptides are chemically active. 
Polypeptides can not serve as representations of themselves 
(or for that matter of anything at all) because their enzymatic 
functions render this impossible. Information flows in one 
direction only. Watson and Crick ( 1953| l thought this idea 
so important that they called it “the fundamental dogma of 
molecular biology.” It is the antithesis of the program-data 
equivalence which makes reflection possible. See Figure 

Combinators are functions with no free variables. In this 
paper we show how programs in a visual programming lan¬ 
guage just as expressive as machine language can be com¬ 
piled into sequences of combinators of only forty two types. 
Where machine language programs would use iteration, the 
programs that we compile into combinators employ non¬ 
determinism. The paper culminates in the experimental 
demonstration of a computational ribosome, a ‘machine’ in 
a 2D virtual world that assembles programs from combina¬ 
tors using inert descriptions of programs (also comprised of 
combinators) as templates. 

^Although they play many roles in machine language programs, 
non-register operands are generally addresses. 


Reified Actors 

Actors are created using three different constructors: [ 
creates combinators, [ J"*" creates behaviors, and [ cre¬ 
ates objects. Like amino acids, which can be composed 
to form polypeptides, primitive combinators can be com¬ 
posed to form composite combinators. Behaviors are just 
combinators that have been repackaged with the [ ]+ con¬ 
structor. Prior to repackaging, combinators do not manifest 
their function; this might correspond (in our analogy) to the 
folding of a polypeptide chain into a protein. 

Objects are containers that can contain other actors. Each 
is one of four immutable types: [ ]o, [ ] i, [ ]2 and [ ] 3 . For ex¬ 
ample, [x, y, zji is an object of type two that contains three 
actors, X, y and z. Primitive combinators and empty objects 
have unit mass. The mass of a composite combinator is the 
sum of the masses of the combinators of which it is com¬ 
posed. The mass of an object is the sum of its own mass and 
the masses of the actors it contains. Since actors can neither 
be created nor destroyed, mass is conserved. 

Actors are reified by assigning them positions in a 2D vir¬ 
tual world. Computations progress when actors interact with 
other actors in their 8-neighborhoods by means of the be¬ 
haviors they manifest. All actors are subject to diffusion. An 
actor’s diffusion constant decreases inversely with its mass. 
This reflects the real cost of data transport in the (notional) 
ACA substrate. Multiple actors can reside at a single site, 
but diffusion never moves an actor to an adjacent occupied 
site if there is an adjacent empty site. 

As with membranes in |Paun ( 1998| l, objects can be nested 
to any level of depth. The object that contains an actor (with 
no intervening objects) is termed the actor’s parent. An ac¬ 
tor with no parent is a root. Root actors (or actors with the 
same parent) can be associated with one another by means 
of groups and bonds. Association is useful because it allows 
working sets of actors to be constructed and the elements of 
these working sets to be addressed in different ways. 

The first way in which actors can associate is as members 
of a group. All actors belong to exactly one group and this 
group can contain a single actor. For this reason, groups de¬ 
fine an equivalence relation on the set of actors. A group of 
root actors is said to be embedded. All of the actors in an em¬ 
bedded group diffuse as a unit and all behaviors manifested 
by actors in an embedded group (or contained inside such 
an actor) share a finite time resource in a zero sum fashion. 
Complex computations formulated in terms of large num¬ 
bers of actors manifesting behaviors inside a single object 
or group will therefore be correspondingly slow. Further¬ 
more, because of its large net mass, the object or group that 
contains them will also be correspondingly immobile. 

The second way in which actors can associate is by bond¬ 
ing. Bonds are short relative addresses that are automati¬ 
cally updated as the actors they link undergo diffusion. Be¬ 
cause bonds are short {L\ distance less than or equal to 
two), they restrict the diffusion of the actors that possess 



























them. Undirected bonds are defined by the hand relation 
H, which is a symmetric relation on the set of actors, i.e., 
H{x,y) = H{y,x). Directed bonds are defined by the previ¬ 
ous and next relations, P and N, which are inverse relations 
on the set of actors, i.e., P{x,y) = N(y,x). 

If the types of combinators and behaviors were defined 
by the sequences of primitive combinators of which they are 
composed, then determining type equivalence would be rel¬ 
atively expensive. For this reason, we chose instead to de¬ 
fine type using a simple recursive hash function that assigns 
combinators with distinct multisets of components to dis¬ 
tinct types; the hash values of composite combinators are de¬ 
fined as the product of the hash values of their components; 
primitive combinators have hash values equal to prime num¬ 
bers]^ Type equivalence for behaviors is defined in the same 
way, the types of combinators and behaviors being distinct 
due to the use of different constructors. Although this hash 
function is (clearly) not collision free, it is quite good and 
it has an extremely useful property, namely, that compos¬ 
ite combinators can be broken down (literally decomposed) 
into their primitive components by prime factorizationj^ 

Apart from composition, containment, group and bonds 
there is no other mutable persistent state associated with ac¬ 
tors. In particular, there are no integer registers. Primitive 
combinators exist for addressing individual actors or sets of 
actors using most of these relations. These, and other primi¬ 
tive combinators for modifying actors’ persistent states will 
be described later. 

Non-deterministic Comprehensions 

Sets can be converted into superpositions using the non- 
deterministic choice operator ( [McCarthy|1963| l: 

amb{} = 0 
amb {.x.y...} = {x,y...). 

When amb is applied to a non-empty set, it causes the branch 
of the non-deterministic computation that called amb to fork. 
Conversely, empty sets cause the branch to fail. When a 
branch fails, the deterministic implementation backtracks. 

Monads are an abstract datatype that allows programmers 
to define rules for composing functions that deviate from 
mathematically pure functions in prescribed ways. Multival¬ 
uedness (represented by sets) and non-determinism (repre¬ 
sented by superpositions) are just two examples. The monad 
interface is defined by two operations called unit and bind. 
Unit transforms ordinary values a into monadic values, e.g., 
uniU X = (x) where A is the superposition monad. Functions 

^We could instead use nested objects to label combinators so 
that they can be compared. This would be like using codons con¬ 
structed from nucleotides to label amino acids in transfer RNAs. 

“^This is analogous to the function in the cell which is per¬ 
formed by the molecular assemblies called proteasomes and in the 
organelles called lysosomes. 


like unit that take ordinary values and return monadic val¬ 
ues are termed monadic functions. Bind (the infix operator 
‘ »= ’ in Haskell) allows monadic functions to be applied 
to monadic values. This permits monadic functions to be 
chained; the output of one provides the input to the next. 

Monads are intimately related to set builder notation or 
comprehensions. By way of illustration, consider the follow¬ 
ing non-deterministic comprehension that fails if n is prime 
and returns a (non-specified) factor of n if n is composite; 


[x\x & {\ . .n—\) ,y & {\ . .x), xy = n] 


Wadler ( 1990|l showed that notation like the above is syn¬ 


tactic sugar for monadic expressions and described a pro¬ 
cess for translating the former into the latter. Comprehen¬ 
sion guards, e.g., xy = n, are translated using the function 


guard^ True = unit^ _L 
guard^ False = zero^ 

where M is the monad and _L is undefined. Because zeroA 
is (), if guard is applied to False, the branch of the com¬ 
putation that called guardA fails. Conversely, if guards is 
applied to True, the branch continues. Using this device, the 
primality comprehension can be desugared as follows 

An ^ (uniu n »= uniu • (— 1) »= amb • i »= 

1 / A A A 

Ax -> (uniu X »= amb • i »= uniu • (x x) »= 

A A 

uniU-(= n) »= guard^ »= uniUx)) 
where (lx) equals {1.. x}. 

From Comprehensions to Dataflow Graphs 

Recall that our goal is to create programs comprised solely 
of combinators. To maximize composability, these combi¬ 
nators should be of a single type, yet the desugared compre¬ 
hension above contains functions of many different types. 
However, if sets are used to represent sets, singleton sets are 
used to represent scalars, and non-empty and empty sets are 
used to represent True and False, then the type signatures 

W-({«}) 

{a} ^ {a} ^ ({fl}) 

are general enough to represent the types of all functions in 
the desugared comprehension. To prove this, we first show 
that amb can be lifted to the type, {a} ({a}), as follows; 

amb'O = 0 

amb'{x,y...} = ({x},{y} ...). 

We then devise a way to lift functions like i with type, a 
{a}. This is accomplished using the bind operator ( »=s) 
for the set monad S. The bind operator behaves like this 

{x,y...} >>= / = /xU/y U... 
















and can be defined as follows 


5 

(»= /) =join5- (mapx/) 
where join^ is right fold of (U) and 

map 5 /{x,y...} = {/x, fy 
Bind can then be used with unit a to lift i into a function 


0-=<£)-=-' 


-KS> 


1 h=-E}-=-&=-&=<!) 


l' = unit/i • ( »= i) 

with the type, {a} ({«}), as demonstrated below 

i'{x,y...} = (ixUlyU...). 


Next we define two functions of type, {a} ({«}), to re¬ 

place guard. The first causes a computation to fail when its 
argument is empty while the second does the opposite; 


some' {} 
some ’ { X, y...} 
none'{} 
none' { x, y...} 


0 

({x,y...}) 

({}) 

()• 


Finally, the desugared comprehension contains functions 
like (—1), (x) and (=) that map scalars to scalars, yet we 
need functions that map sets to superpositions of sets. For¬ 
tunately, sensible lifted forms for these functions are easily 
defined. For example 

pred' = uniU • (mapj (—1)) 
times 'x'y' = uniU {Jc x y | .r G x', y S y'} 
equals 'x'y' = unit^ { x | x G x', y G y', x = y } 

where x' and y' are of type {a}. Using these lifted functions 
and those defined previously, the non-deterministic compre¬ 
hension for deciding primality can be translated as follows: 


Figure 2: Non-deterministic dataflow graph for deciding pri¬ 
mality (top). Dataflow graph compiled into a sequence of 
non-deterministic combinators (bottom). 


From Dataflow Graphs to Combinators 

One might assume that evaluation of dataflow graphs con¬ 
taining junctions would require an interpreter with the 
ability to create and apply anonymous functions or clo¬ 
sures. These would contain the environments needed to 
lookup the values bound to variable names introduced by 
A-expressions. Happily, this turns out to be unnecessary. In 
this section we show how dataflow graphs can be evaluated 
by a stack machine and define a set of combinators that can 
be used to construct stack machine programs. 

In general, combinators apply functions to one (or two) 
values of type {a} popped from the front of the stack and 
then push a result of type {a} back onto the stack. Since 
dataflow graphs are non-deterministic, the stack machine is 
also. This means that each combinator /" transforms a stack 
of sets into a superposition of stacks of sets 



[{«}]-([{«}])• 


Unary operators /' can be converted to combinators of type 
/" as follows: 


An'^ (uniUn' »= pred' »= i' »= amb' »= 


/" (x':.")=map^(:.")(/'x') 


Ax' ^ (uniUx' »= i' »= amb' »= 

A A 

(times'x') »= (equals'n') »= some')) 

where n' is of type {a}. This was a lot of work, but we 
have reaped a tangible benefit, namely, non-deterministic 
comprehensions can now be rendered as dataflow graphs. 
In Figure 1^ (top) boxes with one input have type signatures 
matching /' and boxes with two inputs have type signatures 
matching g'. Arrows connecting pairs of boxes are instances 
of ( »=a). Junctions correspond to values of common 
subexpressions bound to variable names introduced by A- 
expressions. Lastly, (A) is amb’ and is some’. This result 
is important because, without the amenity (provided by all 
general purpose programming languages) of being able to 
define and name functions, comprehension syntax quickly 
becomes unwieldy. For this reason, we make extensive use 
of dataflow graphs as a visual programming language in the 
remainder of this paper. 


where stack s” is of type [{a}], map a maps functions over 
superpositions and (: s”) is the function that pushes sets onto 
the front of s". Note that /" does not change the length of 
the stack; it consumes one value and leaves one value be¬ 
hind. Binary operators g’ can also be converted to combina¬ 
tors of type /" as follows: 

g” (x':y':s")=map^(: s”) {g’x’y’). 


Note that g" decreases the length of the stack by one; it con¬ 
sumes two values and leaves one value behind. The combi¬ 
nator forms of some ’ and none ’ are slightly different; they 
do not push a result onto the stack. Instead, they pop the 
stack when a non-deterministic computation has yielded a 
satisfactory intermediate result (whether that is something 
or nothing) and fail otherwise: 


some 


(x':5") = { 


0 

uniu s” 


ifx' = {} 
otherwise 










none” (;c': i”) = 


unit^i" \f x'= {} 
0 otherwise. 


Multiple functions can be applied to a single value by push¬ 
ing copies of the value onto the top of the stack and then ap¬ 
plying the functions to the copies. This preserves the value 
for future use and eliminates the need for closures. Accord¬ 
ingly, we define a set of combinators that copy and push 
values located at different positions within the stack 


x^'(5") =uniu((5"!!(n-k)):5") 


where k G {0..9}, (!!) returns the element of a list with a 
given index, and n is the length of s ". With this last puz¬ 
zle piece in place, we can finally do what we set out to do, 
namely, compile the comprehension for deciding primality 
into a sequence of combinators 


,, A .. A .. A .. A .. A .. A . 

Xq >=> pred >=> I >=> amb >=>X[ >=> l >=> amb 

>=> X ” >=> times " >=> x (J >=> equals " >=> some " 
where (>=>) is Kleisli composition 


f >=> g = ( »= g) ■ f 

In Figure (bottom) boxes are functions with type signa¬ 
tures matching f". Arrows connecting pairs of boxes are 
instances of ( >=>a). Lastly, is amb" and is some". 

Reified Actor Comprehensions 

The last two sections of the paper demonstrated that: 1) 
Non-deterministic comprehensions can be represented as 
dataflow graphs; and 2) Dataflow graphs can be compiled 
into sequences of combinators that evaluate comprehen¬ 
sions by transforming the state of an abstract machine. In 
this section we describe a visual programming language for 
specifying behaviors manifested by reified actors in a vir¬ 
tual world. All results from prior sections apply. How¬ 
ever, non-determinism must be combined with other effects 
to construct a monad more general than A which we call 
R (for reified actor). In addition to representing superpo¬ 
sitions, monad R provides mutation of a threaded global 
state and data logging so that behaviors composed of com¬ 
binators can report the time they consume. The boxes of 
dataflow graphs with one and two inputs now have types 
{Actor} {{Actor})' and {Actor} {Actor} {{Actor})' 
where ()' is the type constructor of monad R. Arrows con¬ 
necting boxes are instances of ( »= r). Combinators now 
have type [{Actor}] ^ {[{Actor}])' and are composed with 
(>=>«)■ 

Combinators can be divided into the categories: genera¬ 
tors, guards, relations, and actions. Generators are unary 
operators that characterize sets of actors using the devices 
of groups, containment, bonds, and neighborhood (Table[^. 
They can be composed to address different sets. For exam¬ 
ple, an actor’s siblings can all be addressed using the sub¬ 
graph " —@ . Generators can also be composed with 


guards (Table |^. This can be used either to address single 
actors or to specify preconditions for actions. For exam¬ 
ple, the subgraph @ > (A) addresses a single sibling 

while the subgraph fails if the actor has a neighbor. 


Table 1: Unary generators. 


Name 

Abbrev. 

Definition 

hands 

1 

actor sharing hand with x 

nexts 

> 

actor with directed bond from x 

prevs 

< 

actor with directed bond to x 

bonds 


union of hands, nexts and prevs 

neighbors 

# 

actors in neighborhood of v 

contents 

@ 

actors that are contained in v 

parents 


actor that contains x 

members 

* 

members of group of x 

others 

+ 

members of group of x but not x 


Table 2: Unary guards. 


Name 

Abbrev. 

Definition 

amb 

A 

non-deterministic choice 

some 

S 

Fail if empty. 

none 

N 

Fail if non-empty. 


Relations exist for testing equality and type equivalence 
(Table 1^. They are binary operators and are generally ap¬ 
plied to singleton sets in combination with guards to specify 
preconditions for actions. When applied to non-singleton 
sets, the equality operator and its negation compute set in¬ 
tersection and difference. 


Table 3: Binary relations. 


Name 

Abbrev. 

Definition 

same 

= 

set intersection 

different 

! = 

set difference 

similar 


all X type equivalent to some y 

dissimilar 

! ~ 

all X type equivalent to no y 


Actions for modifying actors’ persistent states are the fi¬ 
nal category of boxes in dataflow graphs. Actions are ren¬ 
dered as grey boxes and are executed only after all non¬ 
actions have been evaluated and only if no guard has failed. 
All actions are reversible but the masses and types of primi¬ 
tive combinators and empty objects are immutable. The full 
set of unary and binary actions is shown in Tables]^ and 
Where data dependencies determine order of execution, 
this order is followed. Where it would otherwise be under¬ 
determined, two devices are introduced to specify execution 






























Table 4: Unary actions. 


Name 

Abbrev. 

Dehnition 

drop 

!| 

Delete hand of x. 

unbond 

! > 

Delete directed bond from x. 

unbond ' 

! < 

Delete directed bond to x. 

quit 

* -> 

Remove x from its group. 

exit 

@-> 

Place X inside its parent’s parent. 

digest 

>!> 

Reduce x to primitive combinators. 

on 

/ 

Replace combinator with behavior. 

off 

\ 

Replace behavior with combinator. 


Table 5; Binary actions. 


Name 

Abbrev. 

Dehnition 

grab 

1 

Create hand between x and y. 

bond 

> 

Create directed bond from x to y. 

bond' 

< 

Create directed bond from y to x. 

join 

-> * 

X joins group of y. 

eat 

->@ 

Place X inside y. 

compose 

>=> 

Replace x with x >=> r y. 

swap 

% 

X and y swap positions and bonds. 


order. First, all actions return their first (or only) argument 
if they succeed. This allows one action to provide the input 
to a second and (when employed) introduces a data depen¬ 
dency that determines execution order. Second, execution 
order can be explicitly specihed using dotted control lines. 

In addition to non-determinism and mutable threaded 
state, instances of monad R also possess a data logging 
ability that is used to instrument combinators so that be¬ 
haviors comprised of them can report the time they con¬ 
sume. Because the unit of time is one primitive operation of 
the abstract machine, most primitive combinators increase 
logged time by one when they are tTin. Significantly, this 
occurs on all branches of the non-deterministic computation 
until a branch succeeds so that the full cost of simulating 
non-determinism on a (presumed) deterministic substrate by 
means of backtracking is accounted for. Two kinds of com¬ 
binators increase logged time by amounts other than one. 
Since the time required to compute set intersections and dif¬ 
ferences is the product of the sets’ lengths, for binary re¬ 
lations, the logged time is increased by this value instead 
(which equals one in the most common case of singleton 
sets). Finally, actions that change the position of an actor, 
e.g., join, pay an additional time penalty proportional to the 
product of the actor’s mass and the Li distance moved. 

Ideally, the actor model described in this paper would be 
reified as an ACA so that self-replicating programs consume 
real physical resources. Actors in an embedded group might 
share a single processor or might jointly occupy a 2D area of 
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Figure 3; Behaviors defining a ribosome. 


hxed size that collects a hxed amount of light energy per unit 
time. The effect would be the same; the number of primitive 
abstract machine operations executed per unit time by the 
processor (or in the area) would be hxed. 

For the time being, we implement the reihed actor model 
as an event driven simulation using a priority queue ( |Gille-| 
). Event times are modeled as Poisson processes 
with embedded groups and event rates are con¬ 
sistent with the joint consumption by actors in groups of 
hnite time resources. Events are of two types. When a 
diffusion event is at the front of the queue, the position of 
the group in its neighborhood is randomly changed (as pre¬ 
viously described). Afterwards, a new diffusion event as¬ 
sociated with the same group is enqueued. The time of 
the new event is a sample from a distribution with density 
/d(0 = where m is mass, s is distance, 

and D is the ratio of the time needed to execute one primi¬ 
tive operation and the time needed to transport a unit mass a 
unit distance. As such, it dehnes the relative cost of compu¬ 
tation and data transport in the ACA substrate]^ 

When an action event is at the front of the queue, a be¬ 
havior is chosen at random from among all actors of type 
behavior in the group. After the behavior is executed, the 
time assigned to the new action event is a sample from a 
distribution with density / 4 (f) = where c is the time 

consumed by the behavior. 

Computational Ribosomes 

Biological enzymes can be reihed as chains of nucleotides 
or amino acids. The hrst can be read and copied but are spa¬ 
tially distributed and purely representational; the second are 
representationally opaque but compact and metabolically 
active. Comprehensions can be compiled into sequences 
of primitive combinators and reihed in analogous ways. A 

^In all of our experiments D equals 10. 
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plasmid is a compiled comprehension reified as a chain of 
actors of type combinator linked with directed bonds: 

p =Nr > [ciT > • • • > [cw-ir 


from the genetic information that ribosomes translate into 
enzymes. A factory can be constructed by reifying the be¬ 
haviors defined in Figure as enzymes and placing them 
inside an object with a type distinct from that of ribosomes: 


where (>) is a directed bond and | ] denotes an actor that 
is reified at the root level. A single undirected bond (not 
shown) closes the chain and marks the plasmid’s origin. 
While plasmids are spatially distributed chains of many ac¬ 
tors, enzymes are single actors of type behavior: 

E = |co >=>R Cl >=>R ■ ■ ■ >=>R CN-1 r ■ 


Biological ribosomes are arguably the most important 


component of the fundamental dogma (Watson and Crick 


19531. They translate messenger RNA into polypeptides us¬ 


ing a four stage process of association, initiation, elongation 
and termination. We can construct a computational ribo¬ 
some that will translate plasmids into enzymes by defining 
four behaviors with analogous functions (Figure]^, reifying 
the behaviors as enzymes, and placing them inside an actor 
of type object 


P — l^ribA; ^ribli FubEi ^ribT lo- 

Behavior ribA first checks to see if R possesses a self- 
directed bondj^ If so, ribA attaches R to the plasmid by 
adding it to the group of the initial combinator, |co]^. Next, 
ribi finds an actor in the neighborhood with type match¬ 
ing |co]^ and places it inside R. When R is at position n 
on the plasmid, ribE finds a neighbor with type matching 
[cn+i|^ and composes it with the combinator inside R, i.e., 
with [co >=>« • • • >=>R Cn]^. It then advances the position 
of to n -f 1. This process continues until R reaches N —I, 
at which point ribT promotes the combinator to a behavior, 
expels it, and detaches R from the plasmid. 

If a ribosome and a plasmid are placed in the world with a 
supply of primitive combinators, the ribosome manufactures 
the enzyme described by the plasmid 

R+Pb + Y,cf^bic)lcl R-\-Pi,-\-Ei, 

where C is the set of 42 primitive combinators and mi, (c) is 
the number of combinators of type c in Pt, and Eh, i.e., the 
plasmid and enzyme reifications of behavior b. 

Now that we have a ribosome, we need something to do 
with it. We could (of course) use ribosomes to synthesize 
the enzymes of which they themselves are comprised. How¬ 
ever, it would be more interesting if these enzymes were then 
used to construct additional ribosomes. To accomplish this, 
we need a ‘machine’ that will collect the finished enzymes 
and place them inside an object of the correct type. We call 
this machine d, factory. Factories are copiers of composi¬ 
tional information, which is heritable information distinct 

^Ribosomes without this bond are disabled and serve solely as 
models for factories, i.e., as compositional information. 


P — l^facA; ^facB; ^facY ^ ^facZi ^facZ^l 1 ■ 

Behavior/acA creates a directed bond with any unbonded 
non-empty object it finds in the factory’s neighborhood. 
This object and its contents serve as the model. Behav¬ 
ior/acB creates a second directed bond from the factory to 
an empty object with type matching the model. This ob¬ 
ject serves as the container for the product. Behavior facY 
moves behaviors from the neighborhood similar to those in 
the model into the product. Behavior/acZ recognizes when 
the product contains the full set of behaviors and deletes 
the bond connecting it to the factory. Behavior/acZ' does 
the same but also installs a self-directed bond on ribosomes 
that enables their association behaviors (elements unique to 
facZ' are yellow in Figure]^. 

As an initial experiment, we demonstrate mutual repli¬ 
cation of a mixed population of ribosomes and factories. 
Plasmids Ph encoding enzymes Eh comprising ribosomes 
and factories are placed in a 2D virtual world consisting 
of 64 X 64 sites together with a large surplus of ribosomes 
(r = 64) and single instances of factories with ribosome and 
factory models, Fr and Ff. The supply of primitive com¬ 
binators and empty objects is replenished as instances are 
incorporated into enzymes and products. Consequently, the 
concentration of consumables is held constant. Plasmids and 
consumables required for synthesis of factory enzymes are 
overrepresented relative to those for ribosomal enzymes: 

r/? + Fff-|-F/r-|-£gP;,-|-2|]o + 3|]i W 

—>■ (r -\-1)R-\-2Fr-\-2Fp -f-'Y.gPh 

where the multiset B = {b \ Eh S 2R U 3F }. We observe 
that the ribosomes synthesize the enzymes encoded by the 
plasmids and these are then used by the factories to construct 
additional ribosomes and factories. See Figure]^ 

Conclusion 

Fifty years after von Neumann described his automaton, it 
remains a paragon of non-biological life. The rules gov¬ 
erning CAs are simple and physical, and partly for this rea¬ 
son, the automaton von Neumann constructed using them 
is uniquely impressive in its semantic closure. Yet perhaps 
because RASPs are (in comparison with CAs) relatively 
well-appointed hosts, self-replicating programs in conven¬ 
tional programming languages seem somehow less convinc¬ 
ing. All self-replicating programs must lift themselves up 
by their own bootstraps, yet not all programs lift themselves 
the same distance. The field of programming languages has 
made remarkable advances in the years since von Neumann 
conceived his automaton. Modern functional programming 
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Figure 4: Behaviors defining a factory. 



Figure 5: Average increase in numbers of ribosomes and fac¬ 
tories (ten runs). Error bars show ± one standard deviation. 

languages like Haskell bear little resemblance to the ma¬ 
chine languages that are native to RASPs. In this paper, we 
have attempted to show that programs defined using seem¬ 
ingly exotic constructs like non-deterministic comprehen¬ 
sions can in fact be compiled into sequences of combinators 
with simple, well-defined semantics. Moreover, because 
they do not have address operands, these combinators can be 
reified in a virtual world as actors of only a few fixed types. 
This makes it possible to build programs that build programs 
from components delivered by diffusion using processes that 
resemble chemistry as much as computation. 
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